618 results found.
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
Unspecified
Size:
194K sentence pairs sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Investigating Catastrophic Forgetting During Continual Training for Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shuhao Gu | IWSLT 2015 Data | /N |
Documentation:
I don't know.
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German
Availability:
Freely Available
License:
Unspecified
Size:
4.5M en-de + 0.6M en-fr sentences pairs sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Investigating Catastrophic Forgetting During Continual Training for Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shuhao Gu | WMT14 Data | /N |
Documentation:
I don't know.
Written
Corpus,
Language Type:
Multilingual
Languages:
German Hindi Italian Spanish Swedish
Availability:
Freely Available
License:
OpenSource
Size:
184880 sentences Production Status:
Existing-updated
Use:
Parsing and Tagging
-
Paper title:Semi-Supervised Dependency Parsing with Arc-Factored Variational Autoencoding
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ge Wang | Universal Dependencies | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Japanese
Availability:
Freely Available
License:
Creative Commons Attribution-ShareAlike 4.0 International License
Size:
765 MByte Production Status:
Use:
Information Extraction, Information Retrieval
-
Paper title:Embedding Meta-Textual Information for Improved Learning to Rank
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shigehiko Schamoni | MetaCLIR: Meta-Textual Information for Cross-lingual Information Retrieval | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
Creative Commons Attribution
Size:
None Production Status:
Existing-updated
Use:
Corpus Creation/Annotation
-
Paper title:Grammatical error detection in transcriptions of spoken English
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Andrew Caines | CrowdED Corpus | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English German Polish Spanish
Availability:
Freely Available
License:
Gnu
Size:
17 GByte Production Status:
Newly created-finished
Use:
Natural Language Generation
-
Paper title:The ApposCorpus: a new multilingual, multi-domain dataset for factual appositive generation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yova Kementchedjhieva | ApposCorpus | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Belarusian English Galician German Slovak Slovenian
Availability:
Freely Available
License:
Size:
3400000 tokens Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Optimizing Transformer for Low-Resource Neural Machine Translation
-
Paper track:Short paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ali Araabi | IWSLT 2014 | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
German
Availability:
Freely Available
License:
CC-BY (Creative Commons Attribution 4.0 International)
Size:
18,502 sentences Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:When Beards Start Shaving Men: A Subject-object Resolution Test Suite for Morpho-syntactic and Semantic Model Introspection
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Patricia Fischer | SORTS - A Subject-Object Resolution Test Suite of German minimal sentence pairs for model introspection | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
Czech English French German
Availability:
Freely Available
License:
CreativeCommons
Size:
31014 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Supervised Visual Attention for Multimodal Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tetsuro Nishihara | Multi30k | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Italian Portuguese Spanish
Availability:
Freely Available
License:
CreativeCommons
Size:
multilingual word embeddings in 30 languages and 110 bilingual dictionaries Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:A Locally Linear Procedure for Word Translation
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Soham Dan | MUSE | /N |
Documentation:
https://github.com/facebookresearch/MUSE/blob/master/README.md




